Ranking Search Intents Underlying a Query
نویسندگان
چکیده
Observation on query log of search engine indicates that queries are usually ambiguous. Similar to document ranking, search intents should be ranked to facilitate information search. Previous work attempts to rank intents with merely relevance score. We argue that diversity is also important. In this work, unified models are proposed to rank intents underlying a query by combining relevance score and diversity degree, in which the latter is reflected by non-overlapping ratio of every intent and aggregated non-overlapping ratio of a set of intents. Three conclusions are drawn according to the experiment results. Firstly, diversity plays an important role in intent ranking. Secondly, URL is more effective than similarity in detecting unique subtopics. Thirdly, the aggregated non-overlapping ratio makes some contribution in similarity based intent ranking but little in URL based intent ranking.
منابع مشابه
A Bipartite Graph-Based Ranking Approach to Query Subtopics Diversification Focused on Word Embedding Features
Web search queries are usually vague, ambiguous, or tend to have multiple intents. Users have different search intents while issuing the same query. Understanding the intents through mining subtopics underlying a query has gained much interest in recent years. Query suggestions provided by search engines hold some intents of the original query, however, suggested queries are often noisy and con...
متن کاملUnderstanding the Query: THCIB and THUIS at NTCIR-10 Intent Task
Understanding intent underlying search query recently attracted enormous research interests. Two challenging issues are worth noting: First, words within query are usually ambiguous while query in most cases is too short to disambiguate. Second, ambiguity in some cases cannot be resolved according merely to the limited query context. It is thus demanded that the ambiguity be resolved/analyzed w...
متن کاملTowards Supporting Exploratory Search over the Arabic Web Content: The Case of ArabXplore
Due to the huge amount of data published on the Web, the Web search process has become more difficult, and it is sometimes hard to get the expected results, especially when the users are less certain about their information needs. Several efforts have been proposed to support exploratory search on the web by using query expansion, faceted search, or supplementary information extracted from exte...
متن کاملHULTECH at the NTCIR-10 INTENT-2 Task: Discovering User Intents through Search Results Clustering
In this paper, we describe our participation in the Subtopic Mining subtasks of the NTCIR-10 Intent-2 task, for the English language. For this subtask, we experiment a state-ofthe-art algorithm for search results clustering, the HISGKmeans algorithm and define the users’ intents based on the cluster labels following a general framework. From the Web snippets returned for a given query, our fram...
متن کاملThe Report on Subtopic Mining and Document Ranking of NTCIR-9 Intent Task
In this paper we report our approach and result as a participant of the NTCIR-9 Intent task. INTENT task is a new NTCIR task which consists of two subtasks: (1) Subtopic Mining subtask: given a query, a system lists all possible subtopics that might cover users’ different intents. Our approach is mining the query log to find subtopics candidates and rank them according to the frequencies of eac...
متن کامل